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Gene Expression Profile Changes After Short-activating 
RNA-mediated Induction of Endogenous Pluripotency 
Factors in Human Mesenchymal Stem Cells 

Jon Voutila\ Pal Saetrom^^ Paul Mintz", Guihua Sun^ Jessica Alluin^ John J Rossi^ Nagy A Habib" and Noriyuki Kasahara^-^ 

It is now recognized that small noncoding RNA sequences have the ability to mediate transcriptional activation of specific target 
genes in human cells. Using bioinformatics analysis and functional screening, we screened short-activating RNA (saRNA) 
oligonucleotides designed to target the promoter regions of the pluripotency reprogramming factors, Kruppel-like factor 4 
(KLF4) and c-MYC. We identified KLF4 and c-MYC promoter-targeted saRNA sequences that consistently induced increases 
in their respective levels of nascent mRNA and protein expression in a time- and dose-dependent manner, as compared with 
scrambled sequence control oligonucleotides. The functional consequences of saRNA-induced activation of each targeted 
reprogramming factor were then characterized by comprehensively profiling changes in gene expression by microarray 
analysis, which revealed significant increases in mRNA levels of their respective downstream pathway genes. Notably, the 
microarray profile after saRNA-mediated induction of endogenous KLF4 and c-MYC showed similar gene expression patterns 
for stem cell- and cell cycle-related genes as compared with lentiviral vector-mediated overexpression of exogenous KLF4 
and c-MYC transgenes, while divergent gene expression patterns common to viral vector-mediated transgene delivery were 
also noted. The use of promoter-targeted saRNAs for the activation of pluripotency reprogramming factors could have broad 
implications for stem cell research. 
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INTRODUCTION 

Small RNA molecules are important in the regulation of vari- 
ous molecular and biological activities in the cell, and it is 
now well known that short RNA sequences play a critical role 
in regulating the expression levels of specific genes in a tar- 
geted manner. RNA interference (RNAi) has become widely 
recognized to be an important gene regulatory mechanism 
that causes sequence-specific downregulation of mRNAsJ 
Acting through this mechanism, double-stranded short inter- 
fering RNAs (siRNAs) can knock down expression levels via 
the RNA-induced silencing complex, which mediates deg- 
radation or translational inhibition of the targeted mRNAs.^ 
Moreover, in addition to RNA-induced silencing complex- 
mediated regulation at the post-transcriptional level, RNAi 
can also modulate gene transcription itself. In fission yeast, 
homologs of the RNA-induced silencing complex can regulate 
chromatin through recruitment of histone-modifying proteins 
to loci transcribing small noncoding RNA,^ a mechanism also 
seen in plants, ciliates, nematodes, and flies. Small promoter- 
targeted RNAs have also been shown to repress transcription 
and induce epigenetic changes in eukaryotic cells through a 
mechanism called transcriptional gene silencing."'' 



In human cells, it has recently been reported that short 
RNAs targeted to the promoter regions of certain genes can 
activate expression at the transcriptional level. This phe- 
nomenon has been called RNA activation (RNAa), and has 
been shown to be conserved in other mammalian species, 
including mouse, rat, and nonhuman primates.^ Promoter- 
targeted small hairpin RNAs have also been shown to effi- 
ciently upregulate genes in wVo.^ While the mechanism is not 
completely understood, it appears that naturally occurring 
antisense transcripts arising from or near the same genetic 
locus are able to direct recruitment of Argonaute proteins and 
histone methyltransferases. Short-activating RNAs (saRNA) 
may regulate transcription by targeting these antisense tran- 
scripts for degradation, resulting in a reversal of this epige- 
netic silencing and upregulation of sense mRNA.^°"^^ 

Based on this approach, we have developed a method to 
design saRNAs for upregulation of specific cellular genes. In 
the present study, we focused on the feasibility and genetic 
consequences of upregulating important target genes 
involved in stem cell regulation and reprogramming. Com- 
bined expression of the transcription factors Kruppel-like 
factor 4 (KLF4), P0U5F1 (also called OCT3/4), S0X2, and 
c-MYC has been shown to reprogram mouse and human 
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fibroblasts into induced pluripotent stem (iPS) cellsJ^^'^ In 
particular, KLF4 is important for maintenance of embryonic 
stem cells, and has been reported to be a master regulator 
in embryonic stem cells that controls the expression of other 
pluripotency factors including P0U5F1, S0X2, c-MYC, and 
NANOGJ'* It has previously been reported that direct repro- 
gramming can be achieved in murine fibroblasts with only 
Klf4, Oct4, and Sox2J^ However, it has recently been shown 
that c-Myc is critical for efficient induction of pluripotency in 
the early phases of reprogramming, by altering the metabolic 
state of the cell.^° 

Accordingly, in the present study, we employed a genomic- 
bioinformatic approach to design saRNAs that specifically tar- 
get two of these key reprogramming genes, KLF4 and MYC. 
These saRNAs were tested for their ability to upregulate the 
targeted reprogramming factors, as well as their respec- 
tive downstream genes, in human mesenchymal stem cells 
(MSCs), adult bone marrow-derived tissue-specific stem cells 
that already have multilineage differentiation potential. The 
effects of saRNA transfection on endogenous gene expres- 
sion profiles were compared with those resulting from lenti- 
viral vector-mediated overexpression of the exogenous KLF4 
and c-MYC transgenes. To date, there have been no studies 
comprehensively examining the cellular gene expression pro- 
files after saRNA-mediated gene activation of endogenous 
genes, particularly pluripotency-related genes, and few stud- 
ies comparing how different reprogramming methods might 
differentially affect various cellular pathways. These results 
indicate that the use of saRNA shows significant potential, 
both as a tool for studying stem cell biology, as well as a 
safe method to manipulate stem cell gene expression without 
altering the genome. 



RESULTS 

Design of saRNA candidate sequences targeting KLF4 
and c-IMYC 

To design saRNA candidates for activation of stem cell fac- 
tors, we developed a novel bioinformatic approach. The KLF4 
gene, located on chromosome 9 (9q31 .2), and c-MYC gene, 
located on chromosome 8 (8q24.21), were our initial targets 
for activation (Figure 1a,b). To identify potential antisense 
transcripts from the KLF4 and c-MYC loci, we searched the 
genomic region surrounding each locus for spliced expressed 
sequence tags (ESTs) which mapped to the positive strand. 
Although it is usually difficult to determine the transcriptional 
orientation of ESTs, orientation can be determined by using 
splice site signatures of spliced ESTs.^^ We found no spliced 
ESTs that overlapped KLF4, but the scan identified one anti- 
sense EST (DB461753) -15 kb upstream of KLF4's anno- 
tated transcription start site (TSS). We were also unable to 
find a spliced ESTs that overlapped c-MYC, but identified one 
antisense EST (BC042052) ~2 kb upstream of c-MYC'sTSS. 
These ESTs were then further investigated as potential can- 
didates (Figure 1c). 

Recent deep sequencing experiments have revealed that 
antisense RNAs often are found in the region surround- 
ing TSSs.^^"^'' Therefore, this region was chosen to design 
saRNA sequences that targeted potential antisense tran- 
scripts from the promoter region. We used the antisense 



sequence 500 nts upstream and downstream from the 
TSS (abbreviated KLF4_AS_TSS-i-/-500 and MYC_AS_ 
TSS-I-/-500) as a second target candidate. Our goal was 
to design short sense RNAs that could potentially bind and 
degrade antisense RNAs generated from the two candi- 
date sequences (ESTs and the regions surrounding TSSs) 
with the hypothesis that this binding and degradation of 
the antisense RNAs would activate gene expression. Can- 
didate saRNAs targeting the 500 nt upstream of the TSS 
were designated PR1, whereas those targeting the 500 nt 
downstream of the TSS were designated PR2. To give effec- 
tive antisense targeting and degradation and to minimize 
off-target effects, we used the GPboost sIRNA design algo- 
rithm^"^ to identify potential short RNAs for downregulating 
the two candidate sequences. 

From the lists of predicted siRNA candidates, we selected 
the two most promising non-overlapping siRNA target sites 
on the antisense EST DB461753 and BC042052, and the 
most promising siRNA target site on each side of the KLF4 
and c-MYC TSS within the antisense promoter sequence 
(KLF4_AS_TSS-i-/-500 and MYC_AS_TSS-^/-500). The 
candidate sIRNAs were selected based on predicted efficacy 
score from GPboost.^"' We found four potential saRNA candi- 
dates for activating each gene (Figure 1c). 



Upregulation of target gene expression by saRNAs 

Reasoning that activation of pluripotency factor gene expres- 
sion might be more readily achieved in adult tissue-derived 
stem cells that retain restricted multilineage potential, we then 
tested whether these saRNA candidates could upregulate 
KLF4 or c-MYC expression, respectively, in primary human 
MSCs derived from adult bone marrow. Target gene expres- 
sion levels were examined by quantitative PCR of reverse-tran- 
scribed mRNA from MSC cultures after transfection of each 
individual saRNA candidate oligonucleotide, as compared with 
transfection with an Alexa Fluor 555- or FAM-labeled negative 
control oligo. A transfection efficiency of >95% was determined 
by flow cytometry and by knockdown with control siRNAs 
using the same transfection conditions (See Supplementary 
IVIaterials and IVIethods and Supplementary Figures S1 
and S2). Initially, none of the saRNA oligos appeared to show 
any effect on target gene expression after 48 hours. However, 
upon continued transfection with KLF4-PR1 saRNA every 
other day, significant upregulation of KLF4 mRNA (Figure 2a) 
was observed, on the order of 2.5-fold over controls by day 4, 
and reaching approximately fourfold by day 6 of treatment (P 
< 0.01). Target gene mRNA levels after treatment with KLF4- 
PR1 were significantly higher when MSCs were exposed to 
saRNA at concentrations of 25 or 50 nmol/l, as compared with 
5 nmol/l (Figure 2b). Increased Klf4 protein was confirmed in 
MSCs treated with KLF4-PR1 saRNA (Figure 2c, left panel) 
by western blot analysis, and densitometric quantitation of 
these blots showed over threefold upregulation of Klf4 protein 
relative to p-actin internal control (Figure 2c, right panel), cor- 
relating closely with the level of mRNA upregulation. This time- 
dependent upregulation of KLF4 gene expression by the KLF4 
promoter-targeted PR1 saRNA was observed consistently over 
multiple experiments. None of the other three KLF4-targeted 
saRNA candidate sequences was able to upregulate target 
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Figure 1 Design of KLF4 and c-WIYC saRNA candidates, (a) KLF4 locus and potential antisense target candidates. The schematic 
shows the genomic location of KLF4, the structure of the KLF4 transcript, and the spliced ESTs reported from various cell types in the 
surrounding regions. Red boxes outline the KLF4 promoter region and the closest antisense EST upstream of KLF4 (DB461 753). The anti- 
sense EST DB461753 initiates roughly 15 l<b from KLF4's transcription start site (TSS) and terminates more than 25 l<b away. Red arrows 
indicate the target sites for the short-activating RNA (saRNA) candidates, (b) MYC locus and potential antisense target candidates. The 
schematic shows the genomic location of MYC, the structure of the MYC transcript, and the spliced ESTs reported from various cell types 
in the surrounding regions. Red boxes outline the MYC promoter region and the closest antisense EST upstream of KLF4 (BC042052). 
The antisense EST BC042052 initiates roughly 2 kb from MYC's TSS and terminates 50 kb away. Red arrows indicate the target sites for 
the saRNA candidates, (c) saRNA candidates for KLF4 and MYC genes. The list shows the most promising saRNAs against the antisense 
EST DB461753, BC042052, and saRNAs targeting KLF4 or MYC sequences within a stretch of 500 bp either upstream or downstream of 
the TSS for each gene. EST, expressed sequence tag; n/a, not applicable. 



gene expression, although conversely, KLF4-DB1 appeared 
to reduce KLF4 mRNA levels in MSCs to about 60% that of 
scrambled oligo-treated controls by day 6. 

Similarly, we found that among the four saRNA candidate 
sequences targeted to the c-MYC promoter and antisense 
ESTs, both MYC-PR1 and MYC-PR2 were able to induce 
consistent upregulation of MYC mRNA (Figure 2d) by up 
to 1.8-fold (P < 0.01) and 1.6-fold (P < 0.01), respectively, 
during the 6-day interval of treatment, as compared with 
scrambled sequence controls. While MYC-BC1 and MYC- 
BC2 also appeared to affect MYC mRNA levels to some 
extent, these effects were more modest, and were not con- 
sistent between days 4 and 6. Again, target gene mRNA 
levels after treatment with MYC-PR1 and MYC-PR2 were 
significantly higher when MSCs were exposed to saRNA 
at concentrations of 25 or 50 nmol/l, as compared with 5 
nmol/l (Figure 2e). Image densitometry analysis of western 
blots from MSCs treated with MYC-PR1 or MYC-PR2 again 
confirmed upregulation of c-Myc protein by over 2.5-fold 
relative to p-actin internal control (Figure 2f). All western 
blots were performed in triplicate with replicates shown in 
Supplementary Figure S3. 



To determine whether this upregulation of KLF4 and MYC 
is due to true transcriptional activation, expression levels of 
nascent RNA were assessed. MSCs were pulsed with ethy- 
nyl uridine (EU) during saRNA treatment and total RNA was 
isolated. Newly transcribed EU-RNA was separated from 
total RNA by biotinylation of EU in a copper-catalyzed "click" 
reaction, followed by purification on streptavidin magnetic 
beads. Quantitative PCR of reverse-transcribed nascent RNA 
showed that the level of nascent KLF4 mRNA was signifi- 
cantly upregulated with KLF4-PR1 treatment across a 6-day 
timecourse (Figure 3a). The level of nascent MYC mRNA was 
significantly upregulated with MYC-PR2 treatment, whereas 
no significant change in nascent MYC mRNA expression was 
seen with MYC-PR1 treatment (Figure 3b). 

To verify that these saRNAs do not act through interferon 
response pathways, interferon response gene expression 
levels were assayed. No significant upregulation of interferon 
response genes was detected 24 hours after saRNA treat- 
ment (Figure 3c). In contrast, overexpression of KLF4 and 
c-MYC by lentiviral transduction showed dose-dependent 
induction of interferon response genes (Supplementary 
Figure S4). 
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Figure 2 Screening of KLF4 and c-iVIYC sliort-activating RNA (saRNA) candidates in mesenchymal stem ceils (MSCs). (a) RT- 
qPCR of KLF4 saRNAs treated in MSCs showing increase of KLF4 in KLF4-PR1 . (b) RT-qPCR of KLF4 for KLF4-PR1-treated MSCs 
at the indicated saRNA concentrations for 6 days, (c) Western blot for Klf4 and p-actin protein and relative quantitation in control- and 
KLF4-PR1 -treated MSCs. (d) RT-qPCR of MYC saRNAs treated in MSCs showing increase of c-MYC in MYC-PR1 and MYC-PR2. (e) 
RT-qPCR of MYC for MYC-PR1 and MYC-PR2-treated MSCs at the indicated saRNA concentrations for 6 days, (f) Western blot for 
c-Myc and p-actin protein and relative quantitation of c-Myc in control-, MYC-PR1-, and MYC-PR2-treated MSCs. Asterisks indicate 
statistical significance: *P< 0.05; **P< 0.01. con, control; RT-qPCR, reverse transcription-quantitative PCR. 
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Figure 3 Mechanism of activation by KLF4- and c-MYC- 
targeted short-activating RNAs (saRNAs). (a) RT-qPCR results 
from KLF4-PR1 saRNA-treated mesenchymal stem cells (MSCs), 
showing increases in newly transcribed KLF4 mRNA compared to 
total RNA. (b) RT-qPCR results from MYC-PR1- and MYC-PR2- 
treated MSCs showing increases in newly transcribed MYC mRNA 
compared to total RNA for MYC-PR2. Asterisks indicate statistical 
significance: *P< 0.05; **P< 0.01 . (c) RT-qPCR results from KLF4- 
and MYC- saRNA-treated MSCs, showing no significant increase 
in mRNA levels of l<ey interferon response genes. RT-qPCR, 
reverse transcription-quantitative PCR. 

To further elucidate the mechanism of activation by saRNA, 
the presence of promoter-associated antisense RNAs was 
investigated using 5' Rapid Amplification of cDNA Ends 
(RACE). Since any antisense RNAs involved in regulating 



gene expression may not be polyadenylated,^' random hex- 
amers were used to prime cDNA synthesis from total MSG 
RNA. Antisense strand-specific primers matching the KLF4- 
PR1, MYC-PR1, and MYC-PR2 saRNA sequences were 
used for 5' RACE to amplify potential antisense RNAs that are 
targeted by each saRNA and may be involved in regulation 
of expression. RACE reactions were run on an agarose gel, 
followed by purification, cloning, and sequencing of any prod- 
ucts (See Supplementary Materials and Methods). Despite 
the presence of bands from each saRNA primer, each was 
identified to be a known mRNA with homology to the 3' end of 
the saRNA sequence, likely a result of mispriming at permis- 
sive annealing temperatures (Supplementary Figure S5). 
Repeats of this assay with more restrictive annealing tem- 
peratures yielded no products (data not shown). 

To further rule out involvement of possible antisense RNAs 
arising downstream of each saRNA target sequence, cDNA 
was synthesized from total MSC RNA using antisense strand- 
specific primers for KLF4-PR1, MYC-PR1, and MYC-PR2. 
This cDNA was used as template for PCRs using primers 
downstream of the saRNA target sequence, which would be 
in the direction of the 5' end of any putative antisense tran- 
script. No antisense transcripts were amplified up to 525 bp 
from the KLF4-PR1 target site (Supplementary Figure S6a). 
Of the priming sites up to 667 bp from MYC-PR1, including 
and surpassing the MYC-PR2 target site, two products were 
amplified, cloned, and sequenced corresponding to the 63 
and 136 bp downstream of MYC-PR1 (Supplementary Fig- 
ure S6b). The expression level of this antisense transcript 
was significantly upregulated with l\/IYC-PR2 treatment, 
whereas its upregulation with MYC-PR1 was not statistically 
significant (P= 0.0682) (Supplementary Figure S6c). 

To assess the biological significance of saRNA activation of 
KLF4 and MYC in MSCs, cells were monitored for morpho- 
logical changes during oligo treatment with KLF4-PR1 and 
MYC-PR2, which had been confirmed to be associated with 
increased nascent mRNA transcribed from their respective tar- 
get genes. After 6 days of treatment, KLF4-PR1 -treated MSCs 
showed marked differences in cell morphology compared with 
control, whereas MYC-PR2 had modest changes (Figure 4). In 
contrast to scrambled sequence oligo-treated controls, which 
appeared as progressively more densely packed fibroblastic 
cells over time, KLF4-PR1 -treated MSCs were less confluent 
and predominantly arranged in clusters with epitheloid cell-like 
morphology. MYC-PR2-treated cells were less confluent than 
controls but appeared more heterogeneous with numerous 
epitheloid cells containing enlarged nuclei. 

Thus, target gene mRNA levels after treatment with KLF- 
PR1 and MYC-PR2, were significantly higher when MSCs 
were exposed to these saRNA oligos at concentrations 
of 25 or 50 nmol/l over a 6-day treatment period. Hence, 
these promoter-targeted saRNA oligos consistently induced 
increases in both mRNA and protein expression of KLF4 
and c-MYC, respectively, in a time- and dose-dependent 
manner. Neither of these saRNAs activated the interferon 
response, and both appear to act through specific activation 
of transcription. We therefore focused on these two posi- 
tive saRNAs, using the 6-day treatment protocol, in further 
experiments examining how saRNA-mediated upregulation 
might affect downstream targets. 
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Figure 4 Morphological changes with saRNA treatment. Phase 
contrast images of IVlSCs transfected with the indicated saRNA on 
the indicated day of a 6-day time course. IVISC, mesenchymal stem 
cell; saRNA, short-activating RNA. 



Gene expression profile analysis after saRNA-mediated 
upregulation of KLF4 and c-iVIYC 

Microarray analysis was performed to determine the global 
gene expression profile and to investigate possible off-target 
effects with saRNA treatment. Differential gene expression 
profiles after upregulation of endogenous KLF4 and c-MYC 
by treatment with their respective saRNAs versus control 
oligo were examined, and compared with that after overex- 
pression of exogenous KLF4 and c-MYC delivered by viral 
gene transfer using commercially available second-genera- 
tion lentivirus vectors. MSCs were transduced with 0.1 pg p24 
protein per cell as determined by p24 ELISA, a viral dose that 
does not result in significant induction of interferon response 
genes (Supplementary Figure S4). Normalized expression 
data for individual replicates are included in Supplemen- 
tary Table S1. Only those cellular genes showing upward 
or downward changes in their expression levels at a signifi- 
cance level of at least P < 0.1 , as compared with their levels 
in scrambled sequence oligo-treated controls, were included 
in these analyses. 

Interestingly, analysis of the overall gene expression profile 
in human MSCs after treatment with KLF4 PR-1 (Figure 5a) 
and MYC-PR2 (Figure 5b) showed that the majority of cel- 
lular genes exhibited concordant changes in expression, 
but with some notable differences, between upregulation by 
saRNA and overexpression by lentiviral transduction. For 
MSCs treated with KLF4-PR1 saRNA, 68% of the cellular 
genes showing significant changes in their expression levels 
(971 out of 1 ,429 genes) showed the same pattern of regula- 
tion as Klf4 lentivirus-transduced MSCs. For MSCs treated 
with MYC-PR2, 64% (273 out of 429 genes) exhibited the 
same pattern of regulation. 

However, this indicates that roughly a one-third of the cellu- 
lar genes with significant changes in expression after saRNA 
transfection or lentiviral gene transfer showed discordant 
regulation between these two groups. To determine whether 
these differences were due to off-target effects or resulted from 
the different methods used to activate expression, we used 
MetaCore pathway analysis from GeneGo (Carlsbad, CA) 



to determine what pathways were significantly enriched 
in those genes that were differentially regulated between 
saRNA and virus samples. Notably, the pathways that were 
most significantly differentially regulated between KLF4-PR1 
saRNA-treated MSCs and KLF4 virus-treated MSCs were 
those involved in cytoskeletal remodeling, macropinocytosis, 
and regulation of epithelial-to-mesenchymal transition (EMT), 
including transforming growth factor (TGF)-P induction of EMT 
(Table 1). The pathways that were most significantly differ- 
entially regulated between MYC-PR2 saRNA-treated MSCs 
and the c-MYC virus-treated MSCs included cell survival and 
proliferation pathways such as granzyme A signaling, TGF-|3 
regulation, and telomere length, as well as macropinocyto- 
sis and EMT pathways (Table 2). That is, for KLF4-targeted 
saRNA versus KLF4-virus, as well as MYC-targeted saRNA 
versus MYC-virus, in both cases genes involved in the same 
pathways were found to be discordantly regulated in the 
same manner. The high degree of similarity in differentially 
regulated pathways observed with two different sets of saR- 
NAs versus two different lentivirus vectors suggests that the 
discordant gene expression patterns primarily arise due to 
characteristic cellular changes in response to oligo transfec- 
tion versus viral transduction, rather than off-target effects 
that are shared by each set of saRNAs. 

To determine whether narrowing the focus of the differen- 
tial expression analysis to those types of genes we expect 
to be regulated by KLF4 and c-MYC would yield greater 
similarity, we generated lists of genes involved in stem cell 
maintenance, development, and proliferation, as well as cell 
cycle-related genes from the AmiGO gene ontology data- 
base.^'' Lists of genes used in this analysis are included in 
Supplementary Tables S2 and S3. Heatmaps visualizing 
the differential expression profiles of KLF4 saRNA-treated 
and KLF4 virus-treated MSC compared with scrambled oli- 
go-treated control MSC for stem cell-related genes (Figure 
5c) and cell cycle-related genes (Figure 5d) showed much 
greater similarity, with 74% (26 out of 35) exhibiting concor- 
dant regulation for stem cell-related genes, and 80% (132 
out of 165) exhibiting concordant regulation for cell cycle-re- 
lated genes. Heatmaps focused on cell cycle-related genes 
in MYC-PR2 (Figure 5e) saRNA-treated MSC compared to 
MYC virus-treated MSC also showed a higher degree of simi- 
larity, with 67% of cell cycle-related genes (8 out of 12) for 
MYC-PR2 exhibiting concordant regulation. 

Further DAVID gene ontology (GO) analysis^^'^" of all genes 
revealed that the most significantly enriched GO terms in the 
KLF4-PR1 saRNA-treated samples were the same as those 
in the KLF4 virus-treated samples (Table 3). Similarly, the 
majority of the most significantly enriched GO terms in the 
MYC-PR2 saRNA-treated samples were also significantly 
enriched in the c-MYC virus-treated samples (Table 4). 

To further validate the gene expression profile, we chose 
several well-known transcriptional gene targets of Klf4^^ and 
c-Myc^^ to verify the results seen in the microarray data by 
real-time PGR. We found that KLF4-PR1 saRNA transfection 
resulted in significantly increased expression of the Klf4 target 
genes cyclin D1, ornithine decarboxylase (0DC1), p21, and 
p53 (Figure 6a); KLF4 virus-transduced cells also showed 
significantly increased expression of cyclin D1 and p21 to 
a similar degree, although not of 0DC1 or p53. Similarly, 
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Table 1 Top 10 most significantly enriolied pathways among tliose genes 
that were differentially regulated between KLF4-PR1 and KLF4 virus 
samples 



Pathway 



P value 



Neurophysiological process: receptor-mediated 
axon growth repulsion 


1 .64 X 1 0 = 


Development: HGF-dependent inhibition of TGF- 
P-induced EMT 


2.23 X 10 = 


Cytoskeleton remodeling: TGF WNT, and 
cytoskeletal remodeling 


3.68 X 10 = 


Transport: macropinocytosis regulation by 
growth factors 


1.51 X 10" 


Development: regulation of EMT 


1.67 X 10^ 


Cell cycle: ESR1 regulation of G1/S transition 


3.18 X 10-^ 


Cell adhesion: a-4 integrins in cell migration and 
adhesion 


3.68x10^ 


Translation: regulation of EIF4F activity 


4.09 X 10 " 


Cell adhesion: plasmin signaling 


4.23 X 10 " 


Development: TGF-p-dependent induction of 
EMT via SMADs 


4.23 X 10 " 



Abbreviations: EMT, epithelial-to-mesenchymal transition; HGF, 
hepatocyte growth factor; TGF-p, transforming growth factor-p; SMAD, 
Drosophila Sma/Mad ortholog. 



Table 2 Top 10 most significantly enriched pathways among those genes 
that were differentially regulated between MYC-PR2 and c-MYC virus 
samples 


Pathway 


P value 


Transcription: role of Akt in hypoxia-induced HIF1 
activation 


4.66 X 10^ 


Apoptosis and survival: granzyme A signaling 


6.40 X 10 " 


Development: PDGF signaling via STATs and NF-kB 


7.75 X 10^ 


Normal and pathological TGF-p-mediated regulation of 
cell proliferation 


8.49 X 10^ 


Development: regulation of telomere length and cellular 
immortalization 


1.01 X 10' 


Some pathways of EMT in cancer cells 


3.02 X 10 = 


Cell adhesion: ECM remodeling 


3.19 X 10 = 


GTP metabolism 


3.55 X 10 = 


Transcription: PPAR pathway 


5.01 X 10 = 


Transport: macropinocytosis regulation by growth factors 


5.48 X 10 = 



Abbreviations: ECM, extracellular matrix; EMT, epithelial-to-mesenchymal 
transition; GTP, guanosine triphosphate; NF-kB, nuclear factor-KB; PDGF, 
platelet-derived growth factor; PPAR, peroxisome proliferator-activated 
receptor; TGF-p, transforming growth factor-p. 



Table 3 Top ID most significantly enriched gene ontology terms among KLF4-PR1 and KLF4 virus samples from all genes with corrected P value <0.05 
and absolute fold change >2 



KLF4-PR1 


P value 


Fold enrichment 


KLF4 virus 


P value 


Fold enrichment 


Nuclear division 


2.16 X 10 " 


6.160865987 


M phase of mitotic cell 
cycle 


1.15 X 10-" 


4.594941095 


Mitosis 


2.16 X 10 " 


6.160865987 


Mitotic cell cycle 


2.25 X 10-" 


3.574354207 


M phase of mitotic cell cycle 


3.32 X 10 " 


6.077611041 


Cell cycle phase 


2.50 X 10 " 


3.396337559 


Organelle fission 


7.66 X 10 " 


5.917673909 


Nuclear division 


4.12 X 10 " 


4.535309559 


M phase 


1.34X 10'« 


4.848239763 


Mitosis 


4.12 X 10 " 


4.535309559 


Cell cycle phase 


7.24 X 10 " 


4.220739263 


Organelle fission 


1.46 X 10-" 


4.356284182 


Mitotic cell cycle 


1.97 X 10-1= 


4.399661906 


M phase 


3.72 X 10-" 


3.62313405 


Cell cycle process 


3.21 X 10'^ 


3.223883762 


Cell cycle process 


1.20 X 10-" 


2.81815595 


Cell division 


6.05 X 10-'^ 


4.312307844 


Cell cycle 


1.84 X 10-" 


2.402392917 


Cell cycle 


1.62 X 10-'° 


2.649960034 


Cell division 


1.18 X 10-= 


3.275893776 



MYC-PR2 saRNA transfection resulted in significantly 
increased expression of c-Myc target genes 0DC1 , p21 , and 
p53, as did transduction witli c-MYC virus (Figure 6b). 

Finally, to assess the ability of the KLF4-PR1 saRNA to 
activate the expression of other reprogramming factors, we 
analyzed mRNA expression of 0CT4, S0X2, and IVIYC, as 
well as the stem cell marker NANOG, by real-time PGR. 
We found that KLF4 activation by saRNA was also able to 
activate expression of 0GT4, S0X2, MYG, and NANOG 
(Figure 6c). Further analysis of 0CT4 mRNA isoforms 
showed that 0CT4A was being significantly upregulated, 
whereas 0CT4B showed no difference (Figure 6d). 

Gonversely the ability of MYG-PR2 saRNA to activate 
endogenous KLF4 gene expression was also analyzed. In 
response to MYG-PR2 transfection, there was an approxi- 
mately twofold increase in KLF4 mRNA levels (Figure 6e). 
Notably, this effect of IVIYG-PR2 saRNA was corroborated by 
the microarray analysis (Suppiementary Table S1). 



DISCUSSION 

We have been able to identify and characterize two saRNAs, 
KLF4-PR1 and MYG-PR2, that specifically activate transcrip- 
tion of endogenous KLF4 and MYG genes, respectively, in 



Table 4 Top 10 most significantly enriched gene ontology terms among 



MYC-PR2 samples from all genes with corrected 
absolute fold change >1 .5 


P value <0.05 and 


MYC-PR2 


P value 


Fold enrichment 


Response to nutrient levels" 


0.00284 


8.209112294 


Response to extracellular stimulus" 


0.00422 


7.350886918 


Response to lipid 


0.05296 


35.93766938 


Tube morphogenesis" 


0.05605 


7.640291915 


Embryonic morphogenesis 


0.06378 


4.241823271 


Response to nutrient" 


0.06653 


6.930836237 


Regulation of hydrolase activity 


0.08013 


3.850464576 


Collagen metabolic process" 


0.08119 


23.10278746 


Multicellular organismal macromolecule 
metabolic process" 


0.08949 


20.86703383 


Response to retinoic acid 


0.09499 


19.60236511 


Response to hormone stimulus" 


0.09875 


3.515641569 



■Indicate those that are also enriched in c-MYC virus samples. 



primary human MSGs. Interestingly, in both cases, it was a 
promoter-targeted saRNA that gave the desired effect. Given 
that studies implicate antisense RNAs that overlap the gene 
of interest as the targets for degradation in RNAa,^°" it is 
conceivable that these promoter-targeted saRNAs are tar- 
geting an antisense RNA that has not yet been discovered. 
However, we were unable to identify any antisense transcripts 
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KLF4-PR1 saRNA 
Differential gene 
Expression heatmap 



Color range 




MYC-PR2 saRNA 
Differential gene 
Expression heatmap 



Color range 



KLF4-PR1 saRNA 

Stem cell cycle-related genes 




KLF4-PR1 saRNA 
Cell cycle- 
Related genes 




MYC-PR2 saRNA 
Cell cycle- 
'^^'-^ related genes 

CDC2 



Figure 5 Differential gene expression of saRNA-transfected and virus-transduced samples compared to control, (a) Expression of 
all genes with corrected P value <0.05 and absolute fold change >1.5 for KLF4-PR1 and KLF4 virus samples, (b) Expression of all genes 
with corrected P value <0.1 and absolute fold change >1.5 for MYC-PR2 and c-MYC virus samples, (c) Expression of genes with stem 
cell-related gene ontology, corrected P value <0.1 , and absolute fold change >1 .5 for KLF4-PR1 and KLF4 virus samples, (d) Expression of 
genes with cell cycle-related gene ontology, corrected P value <0.1, and absolute fold change >1.5 for KLF4-PR1 and KLF4 virus samples, 
(e) Expression of genes with cell cycle-related gene ontology, corrected P value <0.1 and absolute fold change >1.5 for l\/1YC-PR2 and 
c-MYC virus samples. saRNA, short-activating RNA. 



arising from the region of the KLF4-PR1 target site. Anti- 
sense transcripts have been reported to arise from the vicin- 
ity of the c-Myc promoter in human cells such as prostate 
cancer cell lines, and we were able to identify one such 
antisense RNA in primary human IVISCs. However, this anti- 
sense RNA does not overlap the I\/1YC-PR2 target site and is 
therefore unlil<ely to be the RNA target of MYC-PR2. Interest- 
ingly, this antisense RNA does overlap the MYC-PR1 target 



site, but cells transfected with MYC-PR1 saRNA did not show 
increased transcription of nascent MYC mRNA. Hence it is 
difficult to ascertain whether MYC-PR1 does target this anti- 
sense RNA, perhaps resulting in increased accumulation of 
MYC sense mRNA at a post-transcriptional level, or if the 
observed upregulation of MYC mRNA levels by MYC-PR1 
is an off-target effect. Due to the relatively small activation 
of MYC, it would be difficult to completely rule out off-target 
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Figure 6 Validation of microarray gene expression of Klf4 and c-IVIyc target genes, (a) RT-qPCR of Klf4 target genes in KLF4-PR1 
and Klf4 virus samples relative to control gene expression, (b) RT-qPCR of c-IVIyc target genes in MYC-PR2 and c-MYC virus samples rela- 
tive to control gene expression, (c) RT-qPCR for KLF4-PR1 activation of stem cell and reprogramming factors relative to control gene ex- 
pression, (d) RT-qPCR for KLF4-PR1 activation of 0CT4 isoforms relative to control gene expression, (e) RT-qPCR for MYC-PR2 activation 
of KLF4. Asterisks indicate statistical significance: *P < 0.05; **P < 0.01 . con, control; RT-qPCR, reverse transcription-quantitative PCR. 



effects even with the use of more sophisticated techniques 
such as chromatin immunoprecipitation analysis. Further- 
more, specific saRNA-targeted effects on MYC expression 
by transcriptional or post-transcriptional mechanisms, and 
coexisting off-target effects, are not necessarily mutually 
exclusive. Although it may not be possible for every gene to 
be activated by RNAa, our results suggest that this method 
could be used to generate saRNA candidates for activation 
of other endogenous genes for which a promoter-associated 
antisense RNA has not yet been defined. 

Focusing on stem cell- and cell cycle-related genes, which 
were expected to be altered by KLF4 and c-MYC expres- 
sion, we found the similarities between saRNA-transfected 
and virus-transduced samples to be quite striking. This was 
confirmed by GO analysis, as the most significantly enriched 
terms were nearly identical in KLF4-PR1 oligo and KLF4 
virus samples, and most of the top MYC-PR2 terms were 
also significantly enriched in the c-IVIYC virus samples. In 



addition, several well-known KLF4 and c-MYC target genes 
were validated by real-time PCR, and showed a similar pat- 
tern of expression in the saRNA-treated samples as in the 
virus-transduced samples. Furthermore, we have shown 
that KLF4 saRNA is able to activate expression of endog- 
enous pluripotency factors including 0CT4, S0X2, NANOG, 
and MYC. Activation of 0CT4 was specific to 0CT4A, which 
has been reported to be essential for stemness in human 
embryonic stem cells.^^ The highly specific nature of 0CT4A 
upregulation in response to transfection of saRNA target- 
ing KLF4 makes this likely to be a true downstream event 
caused by specific activation of KLF4. Furthermore, the 
saRNA-treated MSCs showed marked differences in cell 
morphology, supporting our conclusion that this activation is 
biologically relevant. 

Interestingly, for some downstream gene targets, our 
results were different from what was expected, as Klf4 
is expected to downregulate expression of cyclin D1, 
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ornithine decarboxylase, and p53,^^ whereas c-Myc is 
expected to downregulate p21.^^ However, these previ- 
ously observed results are often cell- and tissue type- 
dependent, as evidenced by more recent reports showing 
activation of cyclin D1 and p53 by Klf4,^'''^^ and activation 
of p21 through p53 by c-Myc^** in different cell types. In 
addition, our data are consistent across both saRNA- and 
virus-treated samples in independent microarray and real- 
time PGR experiments. 

As with RNAi, a primary concern for using RNAa to study 
biological processes is the minimization of potential off-target 
effects. To evaluate possible off-target effects, we compared 
the gene expression profile of saRNA activation of KLF4 and 
MYC to lentiviral-mediated expression. Some significant dif- 
ferences were observed in the total gene expression profile, 
but this is perhaps not surprising, as one method employed 
transfection of saRNA and the other lentiviral transduction. 
Indeed, the use of any transduction method involving intro- 
duction of long stretches of exogenous nucleic acids, includ- 
ing both viral vectors as well as plasmids, will likely activate 
a variety of innate signaling mechanisms. This may result 
in unwanted upregulation of interferons and related genes 
that may not only cause protein synthesis shutdown and 
effects on cell proliferation, but may also impair normal stem 
cell function, as has been described in hematopoietic stem 
cells. Notably, our interferon response gene expression 
analysis as well as our microarray analysis of gene functional 
annotation and differentially regulated pathways showed no 
evidence of interferon response upregulation after saRNA 
oligo transfection. In contrast, we observed significant induc- 
tion of interferon responses upon viral transduction at a dose 
of 1 pg p24 per cell. This corresponds roughly to an multiplic- 
ity of infection of 10, which is within the range of multiplic- 
ity of infections typically used in reprogramming protocols. 
Hence, it may be highly advantageous that RNA duplexes 
<23 bp in length will not cause induction of interferon,''^ while 
effecting transcriptional gene activation of endogenous pluri- 
potency factors. 

In fact, when we focused on the genes that were differ- 
entially expressed between the saRNA and virus samples, 
the most significantly enriched pathways implicated the use 
of lentiviral vector-mediated gene transfer as the cause of 
many of the expression profile changes that were discrep- 
ant. Notably, cytoskeletal remodeling, including TGF-medi- 
ated remodeling, were upregulated in both KLF4 virus- and 
C-MYC virus-treated samples. It is well known that retrovi- 
ruses manipulate the host cytoskeleton to facilitate virus 
entry and integration."^ Furthermore, the presence of TGF 
signaling pathways is also not surprising, considering that 
carryover of HIV Tat can occur after packaging of second- 
generation lentivirus vectors, and that Tat protein has been 
associated with the induction of TGF-p,'"' which likely serves 
multiple functions for the virus, including immunosuppression 
and facilitation of cytoskeletal remodeling. "^''^ The differen- 
tial regulation of macropinocytosis in both KLF4 virus- and 
C-MYC virus-treated samples is also likely due to viral entry, 
as both native HIV- and vesicular stomatitis virus-G (VSV-G)- 
pseudotyped HIV vectors have been reported to use macro- 
pinocytosis for entry.''''*** HIV Tat protein has also been shown 
to enter cells by macropinocytosis."" 



Notably, several EMT regulation pathways were observed 
to be differentially regulated between saRNA- and virus- 
treated samples, irrespective of whether it was KLF4 or 
C-MYC being targeted or transduced. This was quite striking, 
as it has been reported that suppression of EMT signals is 
required for reprogramming mouse fibroblasts.'' Klf4 serves 
to activate the mesenchymal-to-epithelial transition (MET) 
that is required for reprogramming, whereas Oct4, Sox2, and 
c-Myc suppress TGF-(3-induced EMT. The differential regula- 
tion of these pathways in our experiments fits well into this 
model, as in both cases Klf4 and c-Myc are acting in compe- 
tition with theTGF-(3 induced by lentiviral infection. This sug- 
gests that using lentiviral vectors to activate reprogramming 
factors may actually hinder the reprogramming process, as 
these vectors have been shown to activate TGF-|3 signaling. 
This underscores the need for alternative methods of gene 
activation in reprogramming. 

In recent years, invaluable information has been obtained 
from the use of RNAi to study stem cell biology by inhibit- 
ing expression of specific genes involved in the regulation of 
pluripotency within embryonic as well as somatic stem cells. 
Here, we have shown the potential of using saRNAs that, 
conversely, upregulate expression of endogenous genes in 
stem cells. As each gene can be selectively targeted for acti- 
vation, the use of saRNAs may also provide a highly useful 
tool in studying the contribution of individual factors in IPS 
cell reprogramming. Several studies have used inducible 
systems to study the reprogramming process,^"™ ''' but these 
have been limited by the inability to activate or repress the 
activity of each factor individually. Since RNAa is a transient 
process, it may be possible to develop an optimized protocol 
for IPS cell production wherein each factor can be activated 
when needed by the transfection of its specific saRNA, and 
similarly removed when it is no longer necessary. 

In this context, current methods for upregulating KLF4 and 
C-MYC require transfection^^ or viral transduction'^"' of KLF4 
or C-MYC expression vectors into cells. As noted above, this 
study suggests that the TGF-(3 induced by lentiviral vectors 
may actually be detrimental to reprogramming. Further, onco- 
genic reactivation of stably integrated c-MYC transgenes 
poses a serious safety issue to the use of iPS cells. ''^ In addi- 
tion, evidence that latent viral expression of reprogramming 
factors impairs normal differentiation of iPS cells, and intol- 
erance to genomic damage caused by exogenous DNA or 
transposon integration^""" further emphasizes the need for a 
method of IPS cell generation that uses endogenous cellular 
processes and requires no foreign DNA. In this regard, while 
reprogramming to full pluripotency has not, to date, been 
demonstrated with this method, other groups have recently 
shown saRNA-mediated upregulation of endogenous 0CT4 
in a breast cancer cell line," and endogenous KLF4 in pros- 
tate cancer cell lines. Notably, downstream gene expression 
and phenotypic changes induced by saRNA-mediated upreg- 
ulation of KLF4 in prostate cancer cell lines were reported to 
be comparable to those obtained by retroviral vector trans- 
duction. This is consistent with our results obtained in pri- 
mary human MSCs, and suggests that the use of synthetic 
saRNA oligos may prove highly advantageous as a safe and 
efficient alternative for upregulation of endogenous repro- 
gramming genes. 
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MATERIALS AND METHODS 

Bioinformatics andsaRNA design. The genes KLF4 and c-MYC 
were analyzed to design saRNA molecules. Four parameters 
were used: (1) download target gene annotations; (ii) identify 
antisense RNA target sequences; (iii) select promoter anti- 
sense sequences; and (iv) identify candidate saRNAs. First, 
the method downloads information about the target's genomic 
location, orientation, and transcriptional structure from avail- 
able databases such as the RefSeq database at UCSC (Uni- 
versity of California, Santa Cruz). Second, given a database 
of RNA transcripts with known read direction, such as the 
UCSC Spliced EST track, our method searches the database 
for transcripts that are antisense to and in the vicinity of the 
target gene. More specifically, the method identifies antisense 
transcripts that (i) overlap the target's promoter and the target 
mRNA's 5' end; (ii) overlap the target mRNA; (iii) are at most 
20-100 kb upstream of the target's TSS; or (iv) are at most 
20-1 00 kb downstream of the target's polyadenylation site. The 
method uses these four criteria as hierarchical filters such that 
if it finds antisense transcripts that for example satisfy criterion 
(i), the method does not consider the three other criteria. Third, 
based on the target's TSS, the method downloads the anti- 
sense genomic sequence from a fixed size region upstream 
and downstream of the TSS. The typical region size used by 
the method is 500 nts upstream and downstream of TSS, but 
larger or smaller sizes can also be used. Fourth, the method 
designs siRNAs that give effective and specific downregulation 
of the antisense target sequence. The method (i) uses aslRNA 
design algorithm, such as GPboost,^'^ to identify candidate 
effective siRNAs; (ii) removes all candidate siRNAs with aaaa, 
cccc, gggg, or uuuu motifs and GC content <20% or >55%; (iii) 
removes all candidates that have Hamming distance <2 to all 
potential off-target transcripts; and (iv) returns a given number 
of remaining non-overlapping siRNAs sorted by their predicted 
siRNA knockdown efficacy. The method returns the two high- 
est scoring saRNAs for a given antisense target sequence. 

Cell culture. Bone marrow-derived adult human mesenchy- 
mal stem cells (Lonza, Basel, Switzerland) were cultured in 
the manufacturer's media as instructed. The KLF4, MYC, and 
control duplex RNA oligonucleotides were transfected into 
MSCs using Lipofectamine RNAIMAX reagent (Invitrogen, 
Carlsbad, CA) following the manufacturer's protocol with 30 
pmol oligo to 1 \Ji\ reagent in a 24-well plate to a final oligo con- 
centration of 50 nmol/l. Transfections were performed every 
other day for the duration of each experiment. The BLOCK-iT 
Alexa Fluor Red Fluorescent Control (Invitrogen) and Silencer 
FAM labeled Negative Control #1 siRNA (Applied Biosystems, 
Carlsbad, CA), which have no homology to any known gene, 
were used as negative controls and to assess transfection 
efficiency by fluorescence microscopy and flow cytometry. 
Images were taken at xlOO magnification on a Nikon TS100 
microscope (Nikon Instruments, Melville, NY). 

Plasmids and lentivirus vector production. The plasmids pSin- 
EF2-KLF4-Pur and pSin-EF2-c-MYC-Pur were generated by 
cloning human KLF4 and c-MYC transgenes from plasmids 
pMXs-hKLF4 or pMXs-hcMYC,^'' respectively, into the pSin- 
EF2-Pur lentiviral vector backbone. VSV-G-pseudotyped 



second-generation lentivirus preparations were produced 
using standard protocols; briefly, packaging plasmids 
pMD2.G, psPAX2, and transfer vector were cotransfected 
into 293T cells with jetPRIME reagent (Polyplus-transfection, 
New York, NY), and 48 hours later virus-containing super- 
natant was collected, filtered, and concentrated by ultra- 
centrifugation. Vector titers were determined by p24 ELISA, 
performed by the UCLA Virology Core. 

Quantitative reverse transcription-PCR. Total RNA was isolated 
from MSCs using the RNeasy Micro Plus Kit to remove gDNA 
(QIAGEN, Valencia, CA). RNA was reverse-transcribed to 
cDNA using the High Capacity cDNA Kit (Applied Biosystems). 
For nascent RNA analysis, experiments were performed using 
the Click-iT Nascent RNA Capture Kit (Invitrogen) according 
to the manufacturer's protocol with a 1 hour EU pulse before 
sample collection on each day of the experiment. Quantitative 
real-time PCR was performed using Taqman Gene Expression 
Master Mix (Applied Biosystems) on a MyiQ2 thermal cycler 
(Bio-Rad, Hercules, CA) according to the manufacturer's 
standard protocols. The Taqman primer sets used were as fol- 
lows: KLF4, Hs00358836_m1; P0U5F1 (0CT4A and 0CT4B 
isoform), Hs00999632_g1; P0U5F1 (0CT4A isoform), 
Hs01895061_u1; P0U5F1 (0CT4B isoform), Hs00742896_ 
si; S0X2, Hs00602736_s1 ; NANOG, Hs02387400_g1 ; 
MYC, Hs00153408_m1; CCND1, Hs00277039_m1 ; 
CDKN1A, Hs00355782_m1; 0DC1, Hs00159739_m1 ; TP53, 
Hs99999147_m1; ACTB, Hs00357333_g1 (Applied Biosys- 
tems). For interferon response gene expression, primers from 
the Interferon Response Detection Kit (System Biosciences, 
Mountain View, CA) were used for SYBR Green real-time PCR 
with SsoFast EvaGreen Supermix (Bio-Rad). As suggested 
by the manufacturer's protocol, samples were collected for 
expression analysis 24 hours after saRNA transfection or viral 
transduction. Experiments were performed in triplicate wells 
with at least three replicate reactions per PCR. Expression of 
P-actin mRNA was used as an internal control and samples 
were normalized to the scrambled sequence control oligo- 
nucleotide or untreated samples. Statistical significance was 
determined by Student's f-test, with P values <0.05 consid- 
ered significant. 

Western blot. Cells were lysed and protein concentration 
was determined using Coomassie Plus Assay Reagent 
(Thermo Scientific, Waltham, MA). Each sample was loaded 
onto a NuPAGE Bis-Tris Gel (Invitrogen) at 30 |jg/well and 
electrophoresed and transferred according to the manufac- 
turer's specifications. Primary antibodies used were GKLF 
(sc-20691 ; Santa Cruz Biotechnology, Santa Cruz, CA) and 
c-Myc (sc-764; Santa Cruz Biotechnology). |3-Actin primary 
antibody (ab8227; Abeam, Cambridge, MA) was used as an 
internal control and for quantitation. Protein was detected 
using anti-rabbit HRP conjugated secondary antibody 
(HAF008; R&D Systems, Minneapolis, MN), developed using 
Immun-Star WesternC Reagent (Bio-Rad), and visualized on 
a ChemiDoc XRS-i- (Bio-Rad). Blots shown are representa- 
tive from three replicates. Protein quantitation was performed 
using Image Lab software (Bio-Rad). Statistical significance 
was determined by Student's f-test, with P values <0.05 con- 
sidered significant. 
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Microarray and data analysis. Total RNA was isolated from 
treated MSCs as described. RNA was processed and hybrid- 
ized to a GeneChip Human Gene 1.0 ST array (Affymetrix, 
Santa Clara, CA) in triplicate by the City of Hope Microar- 
ray Core Facility (Duarte, CA). Data analysis was performed 
by the UCLA DNA Microarray Core Facility. Samples were 
normalized using the ExonRMAI 6 summarization algorithm 
and filtered on expression percentile in the raw data (20- 
100%). Differential expression analysis compared to con- 
trol samples was performed using an unpaired f-test with 
asymptotic P value computation and Benjamini-Hochberg 
multiple testing correction. Heatmaps were generated using 
hierarchical clustering using centroid linkage and Euclid- 
ean similarity measure. For pathway analysis, lists of genes 
differentially regulated between saRNA and virus samples 
were used to generate significantly enriched pathways in 
MetaCore (version 6.3 build 25177 by GeneGo). For GO 
analysis, lists of differentially expressed genes with the indi- 
cated adjusted P values and absolute fold changes were 
generated for the saRNA- and virus-treated samples ver- 
sus control. These lists were used to generate functional 
annotation charts using DAVID bioinformatic analysis with 
the GOTERM_BP_FAT category on a HuGene-1_0-st-v1 
background. 
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Supplementary Material 

Figure 81. Transfection efficiency of red fluorescent control 

oligonucleotide by flow cytometry in MSCs. 

Figure 82. Transfection efficiency of positive control siRNAs 

as evaluated by RT-qPCR for expression levels of targeted 

genes. 

Figure 83. Replicates of western blots from Figure 2c,f. 
Figure 84. RT-qPCR of Klf4 and c-Myc virus transduction 
in MSCs for interferon response genes at the indicated p24 
amount per cell. 

Figure 85. 5'-RACE for identification of promoter-associated 
antisense RNAs. 

Figure 86. PCR to identify promoter-associated antisense 
RNAs. 

Table 81. Normalized microarray expression data for each 
individual replicate used in this study. 
Table 82. List of genes with stem cell-related gene ontology, 
used to generate Figure 3c. 

Table 83. List of genes with cell cycle-related gene ontology, 
used to generate Figure 3d-f. 
Materials and Methods. 
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